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Abstract. XPath is a simple language for navigating an XML-tree and returning a set 
of answer nodes. The focus in this paper is on the complexity of the containment prob- 
lem for various fragments of XPath. We restrict attention to the most common XPath 
expressions which navigate along the child and/or descendant axis. In addition to basic 
expressions using only node tests and simple predicates, we also consider disjunction and 
variables (ranging over nodes). Further, we investigate the containment problem relative 
to a given DTD. With respect to variables we study two semantics, (1) the original se- 
mantics of XPath, where the values of variables are given by an outer context, and (2) an 
existential semantics introduced by Deutsch and Tannen, in which the values of variables 
are existentially quantified. In this framework, we establish an exact classification of the 
complexity of the containment problem for many XPath fragments. 



XPath is a simple language for navigating an XML document and selecting a set of 
element nodes .9.- At the same time it is also the main XML selection language. Indeed, 
XPath expressions are used, for instance, as basic patterns in several XML query languages 
like XQuery [2j and XSLT |3J^j; they are used in XML Schema to define keys and 
in XLink and XPointer J2] to reference elements in external documents. In every such 
context an instance of the containment problem is present: optimizing XPath expressions 
can be accomplished by an algorithm for containment, and XSLT rule selection and inference 
of keys based on XPath expressions again reduces to containment. In this article we focus 
on the complexity of the containment problem of various fragments of XPath 1.0 using only 
the most common axes, / and //, and extensions in which variables can refer to data values. 
Furthermore, the containment problem relative to a given DTD is investigated. In all cases, 
we only consider the Boolean containment problem. Here, given two XPath expressions p 
and q, the problem asks whether the fact that p selects some path from the root to a vertex 
implies that also q selects some path from the root to a vertex. This restriction is justified 
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as all complexity results we obtain easily transfer to unary and binary containment in the 
spirit of Proposition 1 in j27j . 

The XPath containment problem already attracted quite some attention jl41 1271 1281 1361 
1371 124j ■ We next discuss the known results together with our own contributions. 

A general result establishing a strong upper bound for a large fragment of XPath is due 
to Marx presented in 24,. It is shown there that the containment problem for navigational 
XPath, allowing navigation along all axes, even relative to a DTD, is in exptime. 

Other work has concentrated, like this article, on XPath expressions that can only 
navigate downwards in an XML tree and do not use the order between siblings, i.e., the 
fragment using only the / and // axis. Different fragments can be defined by allowing or 
disallowing the use of the wild-card * in node tests, and filter predicates in location steps. 
In the spirit of the abbreviated syntax of XPath we use / to indicate the use of the child 
axis, // for the descendant axis, * for the wild-card and [] for filter predicates. We denote 
XPath fragments by listing the allowed operators. For instance, XP(/, //,[]) denotes the 
XPath fragment with the child and descendant axes in which the use of filter predicates 
is allowed, but no wild-cards in node tests. 

Among other results, Miklau and Suciu [2Z| obtain that containment for XP(/,//, [],*) 
is CONP-complete fZIj . Here, inside filter predicates and between location steps, no Boolean 
operators are allowed. 

Contributions. The first family of fragments we consider is obtained by allowing disjunction 
(I) in filter predicates and in location steps. We show that, in principle, adding disjunction 
to XP(/,//, [],*) does not make the containment problem harder. Surprisingly, when the 
set of allowed element names (labels) in XML documents is restricted, and given as part of 
the input then the containment problem becomes much harder: complete for PSPACE. The 
results on fragments with disjunction are shown in Tabled 



/ 


// 


[] 




* 


Complexity 


Reference 


+ 


+ 


+ 




+ 


CONP-complete 






+ 


+ 


+ 


+ 


+ 


CONP-complete 






+ 






+ 




CONP-complete 








+ 




+ 




CONP-complete 




iiim.n 


+ 


+ 


+ 


+ 


+ 


PSPACE-complete 


;iven alphabet) 




+ 


+ 




+ 




PSPACE-complete 


;iven alphabet) 





Table 1: The complexity of containment for expressions with disjunction. Square brackets 
refer to the references, parentheses to results of this article. 

Deutsch and Tannen consider XPath containment in the presence of DTDs and 
Simple XPath Integrity Constraints (SXICs) |14j . Here, the input to the containment 
problem consists of two XPath expressions p, q, a DTD and/or a set of integrity constraints 
and it is asked whether p selects a subset of the elements that q selects, in all documents 
respecting the DTD and/or the constraints. They show that this problem is undecidable in 
general and in the presence of bounded SXICs and DTDs. When only DTDs are present 
they have a pspace lower bound and leave the exact complexity as an open question. 

We indicate the presence of a DTD by XP(DTD,...). We give a simple proof that con- 
tainment testing for XP(DTD,/,//,[],*,|) is in exptime (although this result is covered by 
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the above mentioned result of Marx |2l]) and obtain that containment for XP(DTD, /,//,[) 
and for XP(DTD, /,//,[],*) are hard for EXPTIME. We also study the complexity of more re- 
strictive fragments in the presence of DTDs. It turns out that containment of XP(DTD,/,//) 
is in PTiME. On the other hand, containment of XP(DTD,/,[]) is CONP-complete and con- 
tainment of XP(DTD, //,[]) is CONP-hard. It is not clear whether or how the upper bound 
proof in the former case can be extended to include, for instance, the descendant operator. 
The results about the containment problem in the presence of DTDs are summarized in 
Table m 
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Table 2: The complexity of containment in the presence of DTDs. 



The XPath recommendation allows variables to be used in XPath expressions on which 
equality tests can be performed. For instance, jja^x = @b][$y ^ @c] selects all a- 
descendants whose 6-attribute equals the value of variable $x and whose c-attribute differs 
from the value of variable $y. However, under the XPath semantics the value of all variables 
should be specified by the outer context (e.g., in the XSLT template in which the pattern 
is issued). We indicate the use of variables with XPath semantics by XP(...,xvars,...). So 
the semantics of an XPath expression is defined with respect to a variable mapping. We 
show that the complexity of containment is PSPACE-complete under this semantics. For the 
lower bound, it suffices to observe that with variables a finite alphabet can be simulated. 
We obtain the upper bound by reducing the containment problem to the containment of 
several expressions without variables. 

In addition to the XPath semantics, Deutsch and Tannen ^1] considered an existential 
semantics for variables: an expression matches a document if there exists a suitable assign- 
ment for the variables. We denote variables with existential semantics by XP(...,evars,...). 
In it is shown that containment of XP(/,//,[],*,evars) and XP(/,//,[],|,evars) is 11^- 
hard, and that containment of XP(/,//, [],|,evars) under fixed bounded SXICs is in n|'. We 
extend their result by showing that containment of XP(/,//,[],|,evars,7^), that is, inequality 
tests on variables and attribute values are allowed, remains in n^'. Surprisingly, the further 
addition of * to this fragment makes the containment problem undecidable. The results 
about XPath containment for fragments with variables are indicated in Table 01 

Further related work. In j37j . Wood shows that containment of XP(/, //,[],*) in the presence 
of DTDs is decidable. He also studies conditions for which containment under DTDs is in 
PTIME. Benedikt, Fan, and Kuper study the expressive power and closure properties of 
fragments of XPath j2j . They also consider sound and complete axiom systems and normal 
forms for some of these fragments. Hidders, and Benedikt, Fan, and Geerts considered 
the complexity of satisfiability of XPath expressions jl8| ^ . The complexity of XPath 
evaluation has been studied by Gottlob, Koch, and Pichler in ^lEli while its expressive 
power has been addressed by Marx in [22] and |22]. This article is based on [3(1] . 
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Table 3: The complexity of containment in the presence of variables. Note that xvars and 
evars refer to the original XPath semantics and to existential semantics, respec- 
tively. 

Organization. This article is organized as follows. In Section El we define DTDs and the 
basic XPath fragments. We also introduce the necessary machinery w.r.t. unranked tree 
automata. In SectionlHlEl and^lwe consider disjunction, DTDs, and variables, respectively. 
We conclude in Section IHl 

2. Preliminaries 

In this section, we define the tree abstraction of XML documents, DTDs and the frag- 
ments of XPath that we consider. 

2.1. XML-trees. For the rest of this paper, we fix an infinite set S of labels and an 
infinite set D of data values. Only in Section 13.21 we consider XPath expressions over 
a finite alphabet. The set A is always a finite set of attributes. An XML document is 
faithfully modeled by a finite unranked tree with labels from S in which the attributes of 
the nodes have D-values and in which the children of each node are ordered. 

It is common to model the underlying tree as a tree domain. To this end, the edges 
connecting a node with its children are numbered from 1 to n, according to the ordering of 
its children. Each path from the root to a node then corresponds to a sequence of numbers. 
Finally, each node is identified with this sequence. In particular, the root, which corresponds 
to the document node 9_, is represented by the empty string denoted by e. 

More formally, a tree domain D is a finite subset of N* with the following closure 
properties: 

• If • i G D, where f G N* and i G N, then v ^ D. 

• If i > 1 and v ■ i D, then also v ■ {i — 1) D. 

We call the elements of D vertices. A vertex vi with i G N is a child of a vertex v. 
Conversely, v is called the parent oi vi. A vertex vu with u G N"*" is a descendant of v. 
We also say that v is an ancestor of vu. 
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Definition 2.1. An XML-tree (tree for short) is a triple t = (dom(t), labt, Aj), where 
dom(t) is a tree domain over N, and labj : dom(t) S and, for each a £ A, : dom(t) — > D 
are partial functions. Intuitively, laht{v) is the label of v, while A"(w) is the value of w's 
a-attribute, if it has one. 

Of course, in real XML documents there can be vertices with mixed content, but these 
can easily be modeled by using auxiliary intermediate nodes as explained in iSj. We consider 
attributes only in Section |SJ 

For a vertex v € dom(t), we denote by the sub-tree of dom(t) rooted at v. As a 
tree domain in itself, this is the set {w \ vw G dom(t)}. 

2.2. DTDs. We formalize Document Type Definitions (DTDs) as context-free grammars 
with regular expressions on the right-hand side of rules. As usual, we denote by L(r) the 
language defined by the regular expression r. 

Definition 2.2. A DTD is a tuple {d,Sd,'Sd) where is a finite subset of S, Sd € 
is the start symbol, and d is a mapping from to the set of regular expressions over S^. 
A tree t matches a DTD d iff lab((e) = Sd and for every u £ dom(t) with n children, 
laht{ul) ■ ■ ■ \aht{un) € L{d{lab{u))). We denote by L{d) the set of all trees that match d. 

Note that DTDs do not constrain the value of attributes in any way. We usually refer 
to a DTD by d rather than by (d, Sd,'^d)- 

2.3. XPath. We next define the core fragment of XPath that we will consider in Sections|21 
and 12 In our definition we follow Marx f53'. In Sectional we consider a larger fragment 
which allows the use of attribute values. 

Definition 2.3. An XPath expression is generated by the following grammar: 



Here, Ipath is the start symbol which is short for location path; node-test is either a label 
or the wild-card 

We write \p\ for the size of an XPath expression, which is the total number of occur- 
rences in p of axes child and descendant (including those in filter expressions). 
We use U to denote a big disjunction of expressions. 

Note that, as we only consider expressions which navigate downwards in the tree, we 
do not allow absolute location paths, i.e., paths requiring to be evaluated from the root. For 
convenience, we further assume that location steps with the self-axis only occur at the 
top-level. This is no loss of generality, as one can always translate a location path of the 
form 



Ipath 
Istep 



fexpr 



axis 



Istep I Ipath '/' Ipath | Ipath '|' Ipath 
axis '::' node-test ('['fexpr']')* 
self I child | descendant 
Ipath I Ipath 'or' Ipath 



axis :: a[ei] ■ ■ ■ [efc]/self :: a'[fi] ■ ■ ■ [fe] 



into 



axis :: -f[ei] ■ ■ ■ [ek][h] ■ ■ ■ [fi] 



where 




(J if cj' = *; and, 

a' \i a = a' or a = *. 
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Of course, when a ^ a' are two labels in the above expression, then it is unsatisfiable. 

For notational brevity, we often use abbreviated syntax for XPath expressions [S]. Thus, 
instead of 

child:: a [child:: [descendant:: e]/descendant:: c, 

we simply write a[6][.//e]//c. Note that the sub-expression .//e is the abbreviated notation 
for descendant-or-self :: * / child:: e and thus accounts for descendant:: e. 

For the definition of the semantics of XPath expressions we again basically follow [22] . 

Definition 2.4. For each tree t, each location path p induces a binary relation which 
is inductively defined as follows: 

• \a :: n[ei] • • • [ek\lt is the set of all pairs {u,v) for which all the following conditions 
hold: 

— if a is child then v \s a, child of u] 

— if a is descendant then w is a descendant of u] 

— if a is self then v = u; 

— if n is a label then laht{v) = n, otherwise n is * and the label of v can be 
arbitrary; and, 

— Et{v, Bi) is true, for each i < k. Here, Et is defined as follows: 

* Et{v,p) is true for a vertex v and a location path p if and only if there is 
a vertex w such that {v,w) G Ipjf 

* Et{v, ei or 62) is true for a vertex v if and only if Et{v, ei) or Et{v, 62) is 
true. 

• Ip/^I* = Iplt ° IqIi (where o denotes the composition of binary relations); and, 

• b I Qh = Mt u Mt. 

So, the semantics definition associates with every tree t and every expression p a binary 
relation. When the context vertex, i.e., the first vertex in pairs, is fixed to be the root then 
every expression defines the set {v \ {£,v) G Ipjt}- Recall that e denotes the root of a tree. 
We say a tree t matches an expression p (written: t \= p) ii there is some vertex v'mt such 
that (e,f) G Ipjt- In the latter case we interpret an expression p as a Boolean query. 



Symbol 


Meaning 


/ 


child axis is allowed 


// 


descendant axis is allowed 


[] 


filter expressions are allowed 




disjunction ('or' and ) is allowed 


* 


wild-cards are allowed 



Table 4: Symbols used in the notation for XPath fragments. 

We denote sub-fragments of the above defined XPath fragment using the abbreviated 
syntax. We use the notations explained in Table 21 If filter expressions are not allowed, 
location steps are only of the form axis :: node-test. Disjunction is allowed in location paths 
and in filter expressions. If wild-cards are not allowed, every node-test has to be a label. 

We denote XPath fragments by XP(...) where inside the brackets the allowed features 
are listed. For instance, we write XP(/,[],|) for the fragment, where the wild-card is not 
allowed and the descendant axis can not be used. 

In some proofs, we view expressions p from XP(/, //,[],*) as tree patterns as described 
by Miklau and Suciu ^7. For example, the expression a/b//c[d][*/e] corresponds to the tree 
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pattern in Figure ^ Single edge and double edge correspond to the child and descendant 
axis, respectively. We denote the tree pattern associated with an expression p by t{p). 

From this point of view, t \= p ii and only if there is a homomorphism h from t{p) to t, 
i.e., h maps the nodes of h(p) to the nodes of t such that (1) h{v) has the same label as v 
unless V carries a wild-card, (2) h{v) is a child (descendant) of h{u) if and only if is a child 
(descendant) of u. So, h respects labels, child and descendant, and does not care about *. 

Every tree pattern has one selecting node: all pairs {u, v) of nodes of the input tree are 
selected, for which the root of the tree pattern can be mapped to u and the selecting node 
to V. In Figure n the selecting node is labeled by x. 

a 



c X 




Figure 1: The tree pattern corresponding to a/b//c[d][*/e] 

We will frequently make use of the fact that every expression p from XP(/,//,[],*,|) can 
be written in disjunctive normal form, i.e., in the form pi | • • • | Pn, where each pi is an 
expression from XP(/, //,[],*). It should be noted that n can be exponential in \p\. 

2.4. Containment. Corresponding to the three different interpretations of an XPath ex- 
pression as a binary, unary or Boolean query, there are three different notions of query 
containment. 

Definition 2.5. For two XPath expressions p and q, 

• p is contained as a binary query in q, denoted by p Q2 if Ipjt ^ iojt, for every 
tree t; 

• p is contained as a unary query in q, denoted by p if, for each tree t and 
each vertex v of t, {e,v) E {pjt implies (e,f) G Iqjt] and 

• p is contained as a Boolean query in q, denoted p ^ q, if, for every tree t, t \= p 
implies t \= q. 

Definition 2.6. XContainment is the algorithmic problem to decide for two XPath ex- 
pressions p and q, whether p ^ q. 

For a DTD d and two XPath expressions p and g, by p Qd we denote that t \= p 
implies t \= q, for all trees t that match d. 

Definition 2.7. XD Containment is the algorithmic problem to decide for two XPath 
expressions p and q and a DTD, whether p q. 

The restriction to Boolean containment is justified: it is shown in j2Zl that for XP(/,//, [ ] ,*) 
the complexity of deciding binary (or fe-ary, for any k) containment is the same as the com- 
plexity of deciding Boolean containment. For binary queries, this result can be generalized 
to all fragments we consider in this article. More precisely: all complexity results stated in 
this article hold also for unary and binary containment. First of all, binary and unary con- 
tainment are computationally equivalent in our framework as from a node v expressions can 
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only navigate inside the subtree induced by v. Furthermore, to get from unary to Boolean 
containment, it is easy to see that the result in only requires the child axes, i.e., only 
one node is added as a child of the selecting node of the tree pattern. It is straightforward to 
get the same statement also in the presence of a DTD. For the fragment XP(//,|) a similar 
approach works: for the label a of the selecting node a new label a' is introduced and every 
reference of a not-selecting node to a is replaced by a\a' . The lower bound we prove for 
XP(//,[]) also goes through for the binary and unary case. 

2.5. Unranked tree automata. We recall the definition of non-deterministic tree au- 
tomata over unranked trees from |^. These are used in the proofs of Theorem 14.11 and 
Theorem 14.41 We refer the unfamiliar reader to 12^ for a gentle introduction. The alter- 
nating automata of Section operate over ranked trees. 

Definition 2.8. A nondeterministic tree automaton (NTA) is a tuple A = ((5,A, 
(5, F), where Q is a finite set of states, A is a finite alphabet, F C Q is the set of final states, 
and 5 is a function 5 : Q x A — > 2^*^ such that 5{q^a) is a regular string language over Q 
for every a € A and q £ Q. 

A run of A on a tree i is a labeling A : dom(t) Q such that for every v E dom(t) with 
n children we have that X{vl) ■ ■ ■ X{vn) € 6{X{v),lah*{v)). Note that when v has no children, 
the criterion reduces to e G 5(A(f ), lab*(u)). A run is accepting iff the root is labeled with 
an accepting state, that is, A(e) £ F. A tree is accepted if there is an accepting run. The 
set of all accepted trees is denoted by L{A). 

The regular languages encoding the transition function of an NTA are represented by 
NFAs. The size of an NTA is then |Q| + |A| plus the sizes of the NFAs. 

A deterministic tree automaton (DTA) is an NTA where 6{q, a) R 5{q', a) = for 
all a G A and q ^ q' £ Q- The transition function of a DTA is represented by DFAs. 

The following is well known. 

Lemma 2.9. (1) Deciding whether, for a given NTA A, L{A) = is in ptime. ^Tl 
Theorem 19] 

(2) Testing whether, for given DTAs A and B, L{A) C L{B) is in ptime. Theorem 
3] 

(3) Given a DTD d, a DTA A^ such that L{d) = L{Ad) can be constructed in exponential 
time, (essentially, 

3. Containment in the presence of disjunction 

Miklau and Suciu showed that XContainment for XP(/, //,[],*) is CONP-complete fl7| . 
In this section, we consider the addition of disjunction to this fragment and show that 
XContainment remains in CONP. The problem remains hard for CONP even if only the 
child or only the descendant axis is allowed together with disjunction. Miklau and Suciu 
already mention these results but do not provide full proofs |27| (with exception of the lower 
bound for XP(/,|)). Therefore, we decided to include the proofs in this paper. 

We were surprised by the fact that the CONP upper bound strongly depends on the fact 
that the alphabet S is infinite. Let XFContainment be the containment problem, where 
additionally a finite set of labels is given as input and the containment only has to hold for 
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documents with labels from this set. In Theorem 13. 3| we show that even for XP(/,//,|) the 
problem XFCONTAINMENT is hard for PSPACE. 

3.1. Unrestricted alphabet. 

Theorem 3.1. XContainment for XP(/, //,[], *, |) expressions is in CONP. 

Proof. We develop a criterion which allows to check in np whether, for given expressions 
p and q, p % q- Let p and q be fixed and let pi\. . . \pi and qi\. . . \qi> be the disjunctive 
normal forms (DNFs) of p and g, respectively. Hence, each pi and qj is an expression from 
XP(/, //, [],*). Let n and m denote the maximum number of nodes in an expression pi and 
qj, respectively. Let T{n,m) be the set of trees with at most 2n(m + 2) nodes that are 
labeled with symbols that occur in p and with the new symbol 7^ not occurring in p nor in 
q. We prove the following claim: 

Claim, p ^ q there is a t G T{n, m) such that t\=p but q. 

Clearly "<^=" holds. For the other direction we assume that there is a tree s matching 
p but not q. Then s has to match one of the pi. Hence, there is a homomorphism / from 
Pi to s. i.e., h maps the nodes of the tree pattern oi pi to the nodes of s such that (1) h{v) 
has the same label as v unless v carries a wildcard, (2) h{v) is a child (descendant) of h[u) 
if and only if 7; is a child (descendant) of u. 

We construct t by transforming s in several steps. Let V denote the set of nodes of s 
in the image of /. We delete all nodes in s that are neither in V nor an ancestor of a node 
in V . The resulting tree, ti, has at most as many leaves as pi. We replace the labels of 
those nodes of ti which are not in y by # and obtain t2- Let V be the set of branching 
nodes of i.e., those nodes that have more than one child. The set V contains at most 
n vertices. Let a pure path of ^2 be a path without nodes from V yjV . In particular, the 
nodes of a pure path are all labeled with We get t by replacing in t2 each maximal pure 
path with > m + 1 inner nodes by a path with m + 1 7^-labeled inner nodes. We refer to the 
nodes of t which are inserted in this last step as special nodes. Clearly, there is a one-to-one 
correspondence between non-special nodes in t2 and t. For a non-special node u in t, the 
corresponding node in t2 is denoted by u. 

It is easy to see that t G T(n, m), that t\= pi and that t contains at most m + 2 times 
\V\ + \V'\, hence < 2n{m + 2), many nodes. 

We have to show that t ^ q. Towards a contradiction assume that t \= qj, for some 
j. Hence, there is a homomorphism h : qj ^ t. Next, we show that h can be modified 
to obtain a homomorphism from qj to s which leads to the desired contradiction. We first 
define a homomorphism /12 from qj to ^2 as follows. Whenever h{v) is a non-special node 

then h2{v) = h{v). 

Let vi, . . . ,Vk be nodes of qj such that h{vi), . . . , h{vk) are special nodes which lie on 
some path of t2 which consists entirely of special nodes, ordered from the root to the leaves. 
By the choice of m it holds that k < m, therefore there must be an i such that h{vi^i) is 
not a child of h{vi) or h{vi) is not the first node of the path or h{vk) is not the last node 
of the path. 

In either case, we can define /12 for nodes from {vi, . . . ,Vk} such that the child and 
descendant relations are respected. In this way, we get a homomorphism h2 from qj to t2- 
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Clearly, all nodes of qj which are not mapped to nodes in V must be labeled with a *. 
Thus, /i2 also defines a homomorphism from qj to ti and to s, the desired contradiction. 
This completes the proof of the claim. 

It remains to show how the criterion of the above claim can be used for an NP-algorithm 
that checks whether p % q. The algorithm simply guesses an expression pi from the DNF 
of p (by non-deterministically choosing one alternative for each | in p) and a t G T{n,m). 
Then it checks that t \= pi and t ^ q. The latter can be done in polynomial time as shown 
in [m. □ 

Theorem 3.2. (a) XCONTAINMENT for XP(/, |) is CONP-hard. 
(b) XCONTAINMENT for XP(//, |) is CONP-hard. 

Proof, (a) The hardness proof is the same proof that shows that containment of regular 
expressions is CONP-hard [2DJ. We give it for completeness sake and because the next 
proof depends on it. We use a reduction from validity of propositional logic formulas in 
disjunctive normal form which is known to be complete for CONP |2Uj . Let if = V2=i 
be a propositional formula in disjunctive normal form over the variables 
Here, each Cj is a conjunction of literals. For a disjunct C let C be the expression 
o-i / ■ ■ ■ /cLn where 

if -iXj occurs in C ; 

1 if Xi occurs in C; 
(0|1) otherwise. 

Let q be the disjunction of the expressions Cj, i = l,...,m. Further, let p be the 
expression (0|1)/ • • • /(0|1) where (0|1) is repeated n times. Clearly, p C g iff is valid, 
(b) The reduction is similar to the one above except that we define C as axlja^H ■ ■ ■ //flru 
q as the disjunction of the expressions Cj, i = 1, . . . ,m, and p as (0|1)// • • • //(0|1). We 
show that p Q q <^ ip is valid. Suppose p q, then in particular q matches every 
0-1-string of length n, hence, if is valid. To prove the converse direction, suppose if is 
valid. If p matches a branch in a tree then there are in particular n positions with 
or 1. The i-th such position can be seen as a truth assignment to Xj. As if is valid all 
possible assignments are accounted for by q, and q matches that branch. □ 



3.2. Finite alphabet. As mentioned above, when the alphabet is finite, and given as part 
of the input, containment becomes much harder. In the rest of this section, S is therefore 
a finite alphabet. 

Theorem 3.3. XFContainment for XP(/,//, |) is PSPACE-hard. 

Proof. We make use of a reduction from corridor tiling which is known to be hard 
for pspace |H]. Let T = {D, H,V,b,t,n) be a tiling system. Here, D = {ai,...,ak} is 
a finite set of tiles; H,V C D x D are horizontal and vertical constraints, respectively; 
b = (hi, ... , bn) and t = (ti, . . . ,tn) are re-tuples of tiles; and, re is a natural number in 
unary notation. The question is whether there exists a number m and a valid tiling of a 
board with re columns and m rows. Here, a tiling is valid if the following conditions are 
fulfilled: 

• The bottom row is tiled with b. 

• The top row is tiled with t. 
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• For each horizontal pair (x, y) of tiles, (x, y) € H. 

• For each vertical pair {x,y) of tiles {y above x), {x,y) G V. 

We use a string representation of the board where every row is delimited by ^ and the 
last symbol is $. The expression q selects all strings that do not encode a tiling. As S we 
take D U {#, $}. For S = {ci, . . . , Cn} C S, we abbreviate the expression (ci | • ■ ■ | Cn) by 
S. For an expression r, r* denotes r j ■ ■ ■ jr with i occurrences of r. The expression p is .//$ 
assuring that the string contains the symbol $. The expression q is the disjunction of the 
following expressions. 

• some row has the wrong format: 

— some inner row has too few tiles: ljr=ro^ •//#-^*t^ 

— the first row has too few tiles: Ur=o^ 

— the last row has not enough tiles: Ur=o^ -l/H^D^ /'^ 

— some row has too many tiles: .jjD^^^^ 

• $ occurs inside the string: .//%/{D U {$} U {#}); 

• the string does not begin with h: IJiLi ^1/ ' ' ' /^«-i/(Ua 7^6^ ^i)' 

• the string does not end with i: IJILi •// ^aj^u ' ' " /^"/^ 

• some vertical constraint is violated:|J^^^ d2)^v •//'^i/(-^ U {#})"/(i2; and, 

• some horizontal constraint is violated: U(di,d2)0i^ 

.//di/d2. 

Now, T has a solution iff p ^ (7. Clearly, if T has a solution then we can take the string 
encoding of the tiling as a counter example for the containment of p and q. Conversely, if 
p % q then there is a, not necessarily unary, tree t with one branch s ending on a $ such 
that s \= p and q. So, this branch encodes a solution for T. □ 

Actually, in the proof of Theorem l.S.31 the restriction to a finite alphabet is only used to 
express that a certain element name in the XML document does not occur in a certain set. 
Therefore, if we extended the formalism with an operator for a finite set S, expressing 
that any symbol but one from S is allowed, then containment would also be hard for pspace. 

For the upper bound we need the notion of alternating tree automata 33_ which is 
defined next. These automata operate on trees where every node has rank at most k (for 
some fixed k). That is, every node has at most k children. 

Definition 3.4. An alternating tree automaton (ATA) is a tuple A = {k,Q,T,,qQ,6) 
where k > 0, Q is a finite set of states, S is the finite alphabet, qo € Q is the initial state, 
and 6 : Q X T, X {0, 1, . . . , fc} — > B+dO, 1, . . . , A;} x Q) is the transition function. Here, 
B"'"({1, . . . ,k} X Q) denotes the set of positive Boolean formulas over the set {1, . . . , k} x Q. 

A configuration on a tree i is a tuple [u, q] where u £ dom(t) and q (z Q. An accepting 
run of yl on t is a tree s where nodes are labeled with configurations such that the root of 
s is labeled with [e,go]i where e is the root of t and, for every node u of s (including leaf 
nodes), the following local consistency condition holds. Let u be labeled with [v,q] with n 
children labeled [vi, qi], . . . ,[vn, qn]- Then it must hold that 

• each Vi is a child v or v itself; and, 

• 5{q, labt(u), m) is satisfied by the truth assignment p, where m <k\s the number of 
children of v in t, and p{{i,q')) is true if for some i, qi = q' and Vi is the i-th child 
of V (where we view v itself as the 0-th child) . 

A tree is accepted by A if there is an accepting run. By L{A) we denote the set of trees 
accepted by A. 
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Note that ATAs as we defined them do not have final states. These are encoded by 
transitions of the form 5{q, a, 0) = true. 

Theorem 3.5. XFCONTAINMENT for XP(/, //, [], *, |) is in PSPACE. 

Proof. We show first that p % q implies that there is a counter example tree with small 
degree and only a few branching nodes. More precisely, we call a tree A;-bounded if it has 
at most k non-tmary nodes (that is, nodes with more than one child) and every node has 
rank at most k. For an XP(/, //, [], *, |) -expression p let f{p) be the maximum number of 
filter expressions in any disjunct of the DNF of p. We claim that p C g if and only \it\= p 
implies t\=qon the class of /(p)-bounded trees. 

Indeed, suppose there is a t such that t \= p and t ^ q. Let the DNF of p and q be 
Pi\ - ■ ■\Pn and qi\ - ■ ■ |gm, respectively. Thus, for some i, t\= pi, but t ^ qj, for all j. Let h 
be a homomorphism from pi to t and let s be the tree obtained from t by deleting all nodes 
that are neither in the image of h nor ancestors of such nodes. Clearly, s is /(p)-bounded, 
s \= Pi and s ^ qj for all j (otherwise, t \= qj). 

Next, we show that for every XP(/,//, [],*, |) expression p there is an ATA Ap such 
that for every /(p)-bounded tree t, t \= p iS Ap accepts t. Moreover, Ap can be constructed 
in LOGSPACE. 

To this end, let p be an XP(/, //, [], *, |) expression. As the alphabet is finite and fixed, 
we can replace every * with a disjunction of the alphabet symbols. Hence, we assume p 
does not contain *. Let k = f(j>)- We define Ap = {k,Q,T,,qQ,6) where qo = p and Q is 
the set of sub-expressions of p, all filter expressions of p and all node tests of p. Intuitively, 
a pair [v, q] in an accepting run of Ap on a tree t means that q holds in the sub-tree of t 
rooted at v. For all m < k, the transition function is inductively defined as follows: 

• 6{sel± :: a[ei] ■ ■ ■ [ee]/p' , a,m) = (0,/) A ALi(0, e^; 

• ,5(child :: a[ei] • • • [ei]/p' ,r,m) = Vjli[0» A ij,p') A Af=i(i,e,)]; 

• (5(descendant :: cr[ei] • • • [ei\/p' ,T,m) = 

Vf=i(i> [descendant :: a[ei] ■ ■ ■ H/p'lt V {{j, a) A (j,p') A Af=i(j, Si)); 

• 6{a,a,m) = true; 

• S{p I q,a,m) = (0,p) V (0,g); and, 

• 6{ei I 62, a, m) = (0, ei) V (0, 62). 

The combinations d{p,a,m) that are not mentioned are false. For location paths of 
length 1, i.e., if in one of the first three transitions there is no p', the atoms (0,p') or 
{j,p') are removed, respectively. It is straightforward to prove by a nested induction on 
the structure of the tree and the expression that the pairs [v,q] have the intended meaning. 
Therefore, a tree t has an accepting run of Ap if and only if t \= p. 

Thus, to decide p C g it is sufficient to test whether every /(p)-bounded tree accepted 
by Ap is also accepted by Aq. Note that /(jo)-bounded trees can be easily encoded by 
strings. We say that a node is a fork if it has more than one child. As there are at most 
f{p) forks, every tree consists of at most k := f(j)) x f{p) unary paths that are joined at 
the at most f{p) forks. To every path we associate its lower fork (or none if there is no 
such fork). Next, we assign a unique number to each path such that higher paths get lower 
numbers. Let Sj be the concatenation of the labels on path i and let ii, . . . be the paths 
rooted at the fork below branch i. Let Oj := Sj ii • • • Every /(p)-bounded tree t can 
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then be encoded by the string ai ■ ■ ■ am- Let A'p and A'^ be the alternating string automata 
that simulate Ap and Aq on the string representations of bounded trees. Basically, whenever 
the automaton reaches the end of (the encoding of) a path, the numbers ii, . . . ,i£ indicate 
the positions of the children paths and it can reach a path ij by skipping everything before 
its occurrence. This is possible as all paths are ordered and higher numbered paths occur 
to the right. Finally, let A be the automaton that checks whether the input string is a valid 
encoding of a bounded tree. The problem then reduces to testing whether ^ n PI ^A'^ is 
empty. The latter can be done in pspace We arrive at the desired result. □ 

4. Containment in the presence of DTDs 

In this section we study the XDContainment problem, i.e., the containment problem 
relative to a DTD. Deutsch and Tannen show a pspace lower bound for XDCon- 
tainment for XP(/, //,[],*, I). Marx gives an EXPTIME upper bound for a much larger 
fragment, including all axes and negation in filter expressions. 

We show here that XDContainment for XP(/,//,[],*,|) problem is actually exptime- 
complete. We also exhibit a simple fragment with tractable XDContainment and a mod- 
est NP-completeness result on the fragment using only / and []. We do not know how to 
extend the upper bound proof to include // or *. The results of this section are summarized 
in Table m 

We illustrate by a simple example that the presence of a DTD can complicate matters. 
Consider the DTD 

a — > ab \ e 
b ^ c 

c ^ e 

and the expressions p = a/a and q = ./ /b/c. Although p and q are seemingly unrelated 
and, in particular, it is not the case that every path matching p also matches q, it holds 
that each tree which respects the DTD and matches p also matches q. 

We remark that it can be tested in polynomial time whether, for a DTD d and a symbol 
a, there exists a tree t E L(d) with a vertex labeled a. Therefore, we assume in the following 
that each input DTD contains only use/tt/ symbols. In particular, for each d and each symbol 
a € Tid, there is a tree valid with respect to (d, a, S^). 

4.1. A tractable fragment. We start with a fragment in p. 

Theorem 4.1. XDContainment of XP{DTD, /, //)-expressions is in P. 

Proof. Let d be a DTD and p, q be expressions of XP(DTD, /, //). 

We first show how to construct a non-deterministic top-down automaton Ap which 
checks that, for a tree t, t \= p holds. To this end, let p = pi/ jp^j / ■ ■ ■ / /pk-, where in each 
Pi only the child axis is used. For each z, let pi contain child-axis location steps. 

Intuitively, Ap guesses a path in t which matches p and, for each node v on this path it 
maintains the lexicographically maximal (i, j) such that the path from the root to v matches 
the expression pi/ / • • • / /pi~i/ /pj, where consists of the first j location steps of pi (along 
the child-axis). 
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It should be stressed here, that Ap needs non-determinism only to guess the path. The 
computation of the pairs is completely deterministic and similar to the case of the 

standard string pattern matching automata j32j . The automaton, Ap enters an accepting 
state on the leaf u of the distinguished (guessed) path if and only if the computed pair for 
u is {k,ik). On all other leaves it takes an accepting state in any case. 

An automaton Aq which accepts all trees that do not match q can be constructed along 
the same lines. It computes a pair (i, j) with the same intended meaning as above, for every 
node V of the tree and takes an accepting state at all leaves that have not reached {k,ik). 
This automaton is actually deterministic. 

By combining Ap with Aq and the canonical non-deterministic automaton A^ which 
tests t \= d, we obtain an automaton which accepts all counterexamples to p q. As this 
automaton is of polynomial size in p, q, d and testing emptiness of non-deterministic tree 
automata is in ptime (Lemma ITHIl)), we obtain the stated upper bound. □ 

It should be mentioned that in [30] we claimed that XDContainment oiXF{DTD, /, //, 
*)-expressions is in p. Unfortunately, we were not able to extend the proof sketch given 
there into a complete proof. In fact, we conjecture that this problem is CONP-hard. 

4.2. Fragments in CONP. Next, we consider a fragment in CONP. It is open whether 
XP(DTD,/, []) is a maximal fragment whose complexity of containment w.r.t. DTDs is in 
CONP. 

Theorem 4.2. XDContainment for XF{DTD, /, []) is in conp. 

Proof. The obvious idea is to guess a tree t which matches P but not q. A complication 
arises from the fact that the smallest such tree t might be of exponential size due to the 
constraints from d. Thus, we give a non-deterministic algorithm CheckPnotq (d, a, P, q) 
which checks, given a DTD d, a non-terminal a of d, a set P = {pi, . . . ,Pn} of XP(/, [])- 
expressions, and an XP(/, [])-expression q, whether there is a tree t with root symbol a 
which conforms to d, matches all expressions in P but does not match q. But it does not 
explicitly construct such a tree. Clearly, invoking this algorithm with d, q, P = {p} and a as 
the start symbol of d checks whether p q. We note that we allow a set P of expressions 
as input for CheckPnotq because the algorithm uses such sets for recursive calls. 

Algorithm CheckPnotq makes use of two algorithms with slightly simpler tasks. Algo- 
rithm CheckP checks on input d,s,P whether there is a tree t with root s conforming to 
d which contains all the expressions from P. Algorithm Checknotq checks on input d, q 
whether there is a tree conforming to d with a root labelled by the root symbol of q which 
does not match q. 

We assume in the following that all labels of P and q occur in d and that d only 
contains symbols from which a tree can be derived. By s{p) we denote the root symbol of 
an expression p, i.e., the label of the root of t{p). A level 1 sub-expression of an expression 
p is an expression corresponding to a child of the root in t{p). Let / denote the overall 
number of depth- 1-nodes in expressions of P. 

The algorithms are given in Figure |2j The algorithms follow a top-down approach and 
work recursively. The correctness can be shown by induction on the number of recursive 
calls. Checknotq and CheckP are quite straightforward. 

An important point is that in CheckP it is sufficient to consider strings u of length at 
most {\d\ -|-l)(/ + 2). It can be shown by a simple pumping argument that if a tree matching 
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Checknotq(d, a, q) 

(Returns TRUE if there exists a tree with root a which does not match q) 

1. If s{q) / a return TRUE. 

2. If r(g) has only one node return FALSE. 

3. Guess a string u G d{a) of length < \d\ and a level 1 sub-expression q' of q. 

4. If h := s{q') does not occur in u return TRUE. 

5. Return Checknotq(d, 6, g'). 

CheckP((i, a, P = {pi , . . . , 

(Returns TRUE if there exists a tree with root a matching all pi) 

1. If some expression in P does not have the root symbol a return FALSE. 

2. Guess a string u € d{a) of length < {\d\ + + 2). 

3. For each i G {1, . . . , n}, guess a mapping fi from the level 1 sub-expressions 
of Pi to the positions of u. 

4. For each position j of n, which is in the image of at least one of the mappings 

fi 

(a) Let P' be the set of level 1 sub-expressions p with fi{p) = j. 

(b) Call CheckP(d, Uj ,P'). 

5. Return TRUE iff all the recursive calls return TRUE. 

CheckPnotq(d, a,P = {pi, . . . ,Pn}, q) 

(Returns TRUE if there exists a tree with root a matching all pi but not q) 

1. If some expression in P does not have the root symbol a THEN return 
FALSE. 

2. If s{q) / a return Chec'kP{d, a, P). 

3. If T{q) has only one node return FALSE. 

4. Guess a level 1 sub-expression q' of q. 

5. Guess a string u S d{a) of length < ([(i[ + !)(/ + 2). 

6. If b := s{q') occurs in u call CheckPnotq(ci, 6, 0, g'). 

7. For each i G {1, . . . , n}, guess a mapping fi from the level 1 sub-expressions 
of Pi to the positions of u. 

8. For each position j of n, which is in the image of at least one of the mappings 

/. 

(a) Let P' be the set of level 1 sub-expressions p with fi{p) = j- 

(b) Call CheckPnotq(d,■Uj,P',g')• 
9. Return TRUE iff all the recursive calls return TRUE. 



Figure 2: Algorithms Checknotq, CheckP and CheckPnotq used in the proof of Theorem 

P has a level with more children then there is a sub-sequence of these children which is not 
in the image of any mapping and can be removed without leaving d{a) . 

CheckPnotq is basically a combination of Checknotq and CheckP. It has to check that 
there is a path in q which does not match any path in the counter-example tree. Step 6 is 
needed to verify that this also holds in those parts of the tree which are not needed to fulfil 
P. 
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It remains to show that this (non-deterministic) algorithm works in polynomial time. 
This follows directly from the fact that for each node in t{p) and each node v in T(q) there 
is at most one recursive call of CheckPnotq in which v is the root of some expression in P 
(or q'). □ 

The next theorem follows directly from j35j . 

Theorem 4.3. (a) XDContainment for XP(/, []) is CONP-hard. 
(b) XDContainment for XP(//, []) is coNP-hard. 

Proof. In '35 , the following problem is shown to be CONP-hard. 

SIBLING CONSTRAINT IMPLICATION (SC IMP): 

Given: Regular expression r over alphabet S, a set S" C S, and a E E. 

Question: Does every string w G L{r) that contains all the symbols in S also contain the 

symbol a (denoted r |= S — > a)? 

We reduce SC iMPtoXP(/, []) andXP(//, []) . Thereto, assume given r, S = {si, . . . ,Sk}, 
and a. Construct the DTD d consisting of the sole rule start — > r, where start S, then 
start [si] • ■ ■ [sk] '^d start [a] iff start [.//si] • • • [-//sk] start [.//a] iff r ^ 5 — > a. □ 



4.3. Fragments in EXPTIME. When both the child and descendant axes are allowed, then 
adding filter expressions and wild-card or disjunction raises the complexity of XDContain- 
ment to EXPTIME. 

Although in 24_ it is shown that XDContainment is in exptime even for full navi- 
gational XPath by a reduction to propositional dynamic logic, we give here a simpler proof 
of the result for our downwards navigating fragment. 

First, we introduce two concepts. Let p be an XPath-expression. An expression is a 
sub-expression of an expression p if it is generated by an Ipath node in the derivation tree 
of p according to the grammar in Definition 12.31 

The self -closure of p, denoted by self (p), is inductively defined as follows: for a lo- 
cation step pi = axis :: cr[ei] • • • [e^] its self -closure, is self :: 0"[ei] • • • [e^]. For expressions 
Pi/P2 and pi [ their self-closure is self(pi)/p2 and self(pi) [ self(p2)5 respectively. 

Theorem 4.4. XDContainment for XP(/, //,[], *, |) is in exptime. 

Proof. We provide a translation to containment of unranked deterministic tree automata 
whose size is exponential in the input. By Lemma l2.9r 2). the latter is in EXPTIME. 

We first show that for each XP(/,//, [],*, |)-expression p, one can construct in expo- 
nential time an exponential size deterministic tree automaton Ap = {Q,T,d,S,F) such that 
Ap accepts a tree t if and only if i \= p- Here, is the finite alphabet associated to the 
given DTD d. The states of Ap are pairs (S, D) where S and D are sets of sub-expressions 
of p or the self-closure of sub-expressions of p. 

The intended meaning of the states is as follows. If pi £ S" at some vertex u of a tree 
t then \= pi. If pi € -D then there is some node w below v in t such that t^, \= Pi- So, 
S describes all expressions that hold at the current node, while D describes all expressions 
that hold at descendants of the current node. 

Set F = {{S,D) I p € S}. It remains to define the transition function for each 
6{{S, D), a). Recall that the corresponding DFA operates on strings of the form: (5i, Di) ■ ■ ■ (S 
Then S should contain exactly the XPath-expressions generated by the following rules. 



COMPLEXITY OF XPATH CONTAINMENT 



17 



• child :: cr[ei] • • • [e^j/p' G 5 iff tliere is an i < £ sucli tliat self :: cr[ei] ■ ■ ■ [ek]/p' € 

Si] 

• descendant :: cr[ei] • • • [ek]/p' G 5 iff ttiere is an i < ^ sucli tliat self :: cr[ei] ■ ■ ■ [e^j/p' G 
Si u A; 

• self :: cr[ei] ■ ■ ■ [ek\/p' G S" iff o" = a or o" = *, p' G S and G for i = 1, . . . , fc; 

• Pi [ P2 G 5" iff Pi G 5" or p2 £ and, 

• pi or p2 G 5" iff pi G 5 or p2 £ S. 

For tlie case without a sub-expression p' the first three rules are adapted in the obvious 
way. 

For each expression p, p G -D if p G 5^ U Dj, for some i. 

It remains to describe how a DFA B of exponential size can execute the above rules. 
There is a linear number of sub-expressions of p and self-closures of those. The DFA B 
keeps for each of them one bit in memory indicating whether the corresponding expression 
is in S or D. Initially none of them are. An expression is put in a set if one of the above 
rules fire. Every rule should be checked at every transition step. So, the size of each B 
is exponential in p. As Ap contains an exponential number of such DFAs, its size is also 
exponential. 

Let A(i be the exponential size deterministic automaton accepting d (cf. Lemma [2.9f 3)'). 
Then deciding whether p Q reduces to testing whether L{Ad) H L{Ap) C L{Aq). By 
Lemma 11212), the latter can be done in exptime. □ 

Theorem 4.5. XDContainment for XP(/, //, |) is hard for exptime. 

Proof. The proof makes use of a reduction from two-player corridor tiling. This is 
the extension of corridor tiling, used in the proof of Theorem 13.31 to two players. Let 
T = {D, H,V,b,i,n) be a tiling system. Again, D is a finite set of tiles; H,V D x D 
are horizontal and vertical constraints, respectively; b and t are n-tuples of tiles; and n is 
a natural number. There are two players (I and II) that place tiles in turn on an n x N 
board. On this board the bottom row is tiled with b. Player I starts on the first square 
of the second row from the bottom. Each player in turn places a tile on the next free 
square going from left to right and from bottom to top. While player I tries to construct 
a corridor tiling from b to i, player II tries to prevent it. If player II places a tile which 
is not consistent with respect to the horizontal and vertical constraints then player I can 
answer with a special tile '!'. Player I wins if a tiling is constructed satisfying the horizontal 
and vertical constraints with the top row tiled with i, or if she answers an inconsistent tile 
placed by II with '!'. We say that player I has a winning strategy if she can always win no 
matter how II plays. It is well-known that it is EXPTiME-complete to determine whether 
I has a winning strategy iSj . This result even holds if the number of tiles in a row is forced 
to be even. Thus we assume in the following that n is even. 

We encode strategies for player I as trees. For each position in the game in which II 
moves, the tree contains all possible moves of player II and, for each I-position, it contains 
only one move. Thus, such a tree encodes a winning strategy if and only if each path 
corresponds to a correct tiling or to a wrong move of II. 

To this end, we use symbols of the form {a,i), where a £ D and i G {1,2} plus some 
additional auxiliary symbols: $ to indicate the borders between rows, # to mark the end of 
a tiling and the "protest symbol" '!'. Inner nodes of the tree correspond either to moves of 
I or II. Nodes corresponding to I are labeled (a, 1) where o is the tile chosen by I in this 
move. They have one child for every tile corresponding to the possible next moves of II. 
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Nodes corresponding to II are labeled (a, 2) and have only one child which is the unique 
answer move of I according to her strategy or one of $, # or '!'. Nodes with label $ have 
one child corresponding to a move of I. Nodes with label ^ or '!' arc leaves. The root of 
the tree is labeled S and represents an empty (dummy) move which has to be answered by 
I. It has one child. On each path from the root, I-nodes and Il-nodes alternate (when we 
ignore the intermediate $ nodes). 

Now wc describe the reduction in more detail. Let D = {di, . . . ,dm}- We use the 
following DTD / which defines all possible strategy trees for player I: 

5^ (di,l) + --- + (d„,l) 

$^ (di,l) + --- + (d,„,l) 

and for every d £ D, wc have the rules 

(d,l)^(di,2)---(d^,2) 

{d, 2) ^ (di, 1) + • ■ ■ + {dm, 1) + # + $+! 

Note that # and '!' are the only terminal symbols. Thus, each path in the tree either ends 
with # or '!'. 

A derivation tree encodes a strategy tree (or game tree) for I. As the bottom and the 
top row are fixed we do not represent them in these trees, i.e., only intermediate rows are 
represented. We assume that the tiling consisting only of the top and bottom row is not 
valid. Therefore any strategy tree has to represent at least one row. 

We have to check whether there is a tree encoding a valid strategy tree for I. We will 
construct an expression q which selects a tree if and only if it does not encode a winning 
strategy for I. Thus, S q if and only if player I has no winning strategy. 

We define 

q '■= Qn tiles \ Qv \ iH \ q\, 

where the sub-expressions on the right hand side will be defined shortly. Intuitively, qn tiles 
expresses that some row has a wrong length (which can only be due to player 1), qv and 
qn express that some vertical or horizontal constraint, respectively, is violated by I, and q\ 
expresses that I used the protest symbol wrongly. Each of these sub-expressions identifies 
an error in the strategy tree. Hence, if every tree matches one of these expressions, every 
tree contains an error and no tree can be a valid strategy tree. 

Note that, although the expressions under consideration do not have the wild-card 
available, the disjunction of all alphabet symbols defined by the grammar is a kind of wild- 
card as the DTD assures that no other symbols occur in the tree. We denote the set of all 
pairs {a, i), where cr is a tile and i G {1, 2} by S. Further, we denote by 11$ the set S U {$}. 

Three types of errors can occur in a strategy tree: (1) the tree is of the wrong shape; (2) 
player I places an inconsistent tile, or (3) player I uses the symbol '!' although II placed 
a correct tile. 

A row does not contain exactly n tiles.: 

n-l 

gntiles:=-//^"+'l U I /^V($ I #)• 
i=0 

Recall that we use as a shortcut for di | • • • | d^. 



COMPLEXITY OF XPATH CONTAINMENT 



19 



Vertical Constraints are violated.: 

qv:=qi\qt\ \J .//(d', 1), 

(d',d)(^V 

where 

n 

i=l {b,4)<iV 

checks the vertical constraints w.r.t. 6, and 

n 

rf-=[j U .//m)/s"-v#, 

i=l (d,ti)^V 

checks the vertical constraints w.r.t. t. 
Horizontal Constraints are violated.: 

qH:= U .//(d',2)/(d,l). 

{d',d)(jl:H 

Wrong use of '!'.: 

n 

g, :=( U .//(rfi,2)/S^/(d2,l)/(ci,2)/!) U (U \J S/^^-' /{d\l)/{d,2)/\) 

{d2,d)&H (d',d)eH 

This expression takes care of the case that player I uses the protest symbol '!' 
although the last tile placed by player II was consistent with the vertical and 
horizontal constraints. The second disjunct in this expression takes care of the first 
row of the game (i.e., the second row in the tiling). □ 

By similar techniques, making use of the techniques of Lemma 3 in |27|, it can be shown 
that XDCONTAINMENT for XP(/, //, [], *) is hard for exptime. 

The outermost union can be handled as in Lemma 3 of |27j . Of course, the DTD has 
to be adapted accordingly. 

5. Containment in the presence of variables 

In this section, we study Boolean containment of XPath expressions which allow the 
comparison of attribute values. More precisely, in this section we consider XPath expression 
generated by the following grammar. 



Ipath ::= Istep | Ipath '/' Ipath [ Ipath 'j' Ipath 

Istep ::= axis nodetest ('['expr']')* 

axis ::= self | child | descendant 

expr ::= fexpr | vexpr | expr 'or' expr 

fexpr ::= Ipath 

vexpr ::= variable = attribute | variable ^ attribute 



A variable x is denoted as $x, an attribute a as @a. In this section, fragments without 
filter expressions still allow sub-expressions of the type vexpr. Furthermore, fragments 
without filter but with disjunction allow disjunctions of sub-expressions of type vexpr (cf. 
Theorem 15 . 3^ cl ) . 
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We consider two different semantics. The original XPath semantics and the existential 
semantics, introduced in jl4j . 

In the XPath semantics, variable bindings are defined in an outer context. In partic- 
ular, the value of an expression is defined with respect to a variable assignment p : X — > D 
where X is the set of all variables. We denote the truth value of a filter expression e rela- 
tive to a variable assignment p and a vertex v by E^(v, e) and the semantics of an XPath 
expression p hy [p]^. 

Formally, we extend Definition 12.41 as follows. Given a tree t, a node v, an attribute a 
and an assignment p, E^{v,$x = @a) is true if p{x) = Af(?;). Likewise, E^{v,$x ^ @a) is 
true if p{x) ^ Af (f ). The semantics of is then defined accordingly and we write t p 
if there is a vertex v of t such that (e, v) G IpJt- 

Deutsch and Tannen consider a different semantics which does not assume an 
external variable binding but rather allows a choice of values for the variables that makes 
the expression match. More formally, t \= p under existential semantics, if there is a 
variable assignment p and a vertex f of t such that t \=p p. We will write this as t p. 

We denote the allowance of variables under XPath semantics by xvars and under the 
existential semantics by evars; the presence of inequalities with variables is denoted by 7^. 
For instance, XP(/, //, xvars, 7^) denotes the XPath fragment with / and //, together with 
variable equality and inequality under the XPath semantics. 

We define Boolean containment of XP-expressions with attribute values correspond- 
ingly. More precisely, under the XPath semantics, p ^ q holds, if for every tree t and every 
variable assignment t \='^ p implies t \=p q. Under the existential semantics, p ^ q if for 
every tree t, if there is p such that t p then there is also a variable assignment vr such 
that t ^'^ g. 

In the first subsection, we show that adding variables under XPath semantics to the 
basic XPath fragment with disjunction results in a XContainment problem with pspace 
complexity. 

In [Tlj, it is shown that XCONTAINMENT for XP(/, //,[], *, |, evars) is Ilf -complete [TI] 
(Theorems 2.3 and 3.3). Further, they show that XContainment for XP(/, //, [], evars) is 
CONP-complete. We show in the second subsection that in the presence of inequalities these 
hardness results hold for even smaller fragments. 

Adding variables with XPath semantics to the basic fragment with disjunction but 
without wild-card gives a Il^'-complete XContainment problem. Surprisingly, adding wild- 
card to this fragment ends up in an undecidable XContainment problem. 

The results of this section are summarized in Table 

For a set X of variables an equality type e is an equivalence relation on X. Intuitively, 
e describes which variables have the same value. 

5.1. XPath semantics. We start with the PSPACE-upper bound. 

Theorem 5.1. XContainment for XP(/,//, [], |, *,xvars, 7^) is in pspace. 

Proof. We basically show that the problem can be reduced to the case without variables. 
Let p and q be two expressions with variables {xi, . . . , Xk} and let S' be the set of element 
and attribute names which appear in p or g (and which we assume to be disjoint). Let 
S = S' l±) {#}. Clearly, p Q q holds in general if and only if it holds for trees with element 
names from S. For each equality type e of the variables we construct two 
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expressions pe and qe in XP(/, //, [], j, *). By construction it then follows that p C g if and 
only if, for every equality type e, pe C q^. As we can cycle through all equality types e 
and construct each p^ and qe in PSPACE, and each single test will be doable in pspace, the 
complexity of the overall algorithm is pspace. 

Let us first consider the equality type e where all variables are pairwise different. With 
every tree t and every variable assignment p of type e, we associate a tree tp over the 
alphabet F = S U {xi, . . . , x^, none} as follows. We add, for each attribute a of a node v, a 
new child of v labeled by a which itself has a child which is labeled by one of xi, . . . , or 
with none depending on @a. More precisely, if @a equals some p{xi) then the grandchild 
of V is labeled by Xj otherwise by none. 

Correspondingly, we construct pe by replacing in p each subexpression Sxj = @a by 
./a/xi and each %Xi ^ @a by ./a/(xi | • • • | Xj_i | Xj+i | • • • | x^ | none). We construct qe 
from q in the same way. It is easy to see that, for each t, and each variable assignment p of 
type e, t \=f p if and only if tp \= Pe- 

Thus, it remains to test whether, for all trees of the form tp, tp \= pe implies tp \= qe. 
This can be done along the lines of the proof of Theorem IH.5I In particular, it is sufficient 
to consider only /(pe)-bounded trees. 

This completes the description of the algorithm for equality type e. If, for some other 
equality type, two variables get the same value then we can replace one of them by the other 
in p and in q. Hence, we get possibly fewer variables which are again pairwise different. □ 

We show that the pspace upper bound is tight in the general case and exhibit some 
lower bounds for restrictions of the formalism. 

Theorem 5.2. (a) XContainment for XP(/, [],xvars, 7^) is CONP-hard. 

(b) XContainment for XP(/, //,xvars, is coNP-hard. 

(c) XContainment for XP(/, //, |, *,xvars, 7^) is PSPACE-hard. 

Proof. The proofs of (a) and (b) are by reduction from the set of unsatisfiable 3SAT- 
formulas. To this end let = (^1 A ■ ■ ■ A(^fc be a 3CNF formula where each ipi is a disjunction 
hi V li2 V /i3 of 3 literals. Let xi, . . . ,Xm be the variables occurring in if. In both cases, 
intuitively we construct from ip an expression p that, describes all (tree representations of) 
possible truth assignments to the literals of (p. The expression q selects a truth assignment 
if it leaves at least one clause unsatisfied. Thus, ip is unsatisfiable iff p C g. 

More precisely, both reductions map p to expressions with variables y and yi, . . . ,ym- 
The value p{y) represents true, the other variables represent a truth assignment to xi, . . . , Xm 
in the obvious way: Xj is true iff p{yi) = p{y). 

In the trees we are interested in, the root a-attribute carries the data value correspond- 
ing to true and each literal lij is represented by one node Vij with an attribute a and 
label b. Expression p tests that the attribute values of these nodes are consistent with the 
truth assignment induced by the yf. this is checked by variable expressions v{lij) which are 
%yi = @a if lij = xi and $yi ^ @a if kj = -ix;. 

(a) In case of XP(/, [],xvars, 7^), the trees selected by p have one path of length 3, for each 
clause. To this end, let, for each i, pi be the sub-expression 

Pi = b[viki)]/b[v{li2)]/b[vihs)], 

and let p be 

p = b[$y = @a]\pi]---[pk]. 
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Thus, t p guarantees that t has, for each clause (pi, a path in which the j-th node 
(j = 1, 2, 3) has an attribute value consistent with the truth value of lij, as induced by 
P- 

Thus, formula (f is unsatisfiable, if every tree which "passed" p has a path of length 
3 in which all induced truth values are false. To this end, we define 

q = b[$y = @a\/b[$y + @a]/6[$y @a]/6[$y / @a]. 

(b) Recall that in XP(/, //, vars, 7^) sub-expressions of the forms [$?/ = @a] and [$y 7^ @a] 
are allowed. As XP(/, //, vars, 7^) can not refer to branches of a tree, in this case, the 
vertices Vij have to be organized in a linear fashion. Basically, in the construction of 
(a), we replace the general [] subexpressions in p by using // in q. To this end, the 
trees matched by p need to have a path of length 4A; with label pattern (cbbb)* . The 
i-th. subpath labeled cbbb corresponds to ipi. 

Thus, 

Pi = c/b[v{la)]/b[v{k2)]/b[v{li3)] 

p = b[$y = @a]/pi/---/pk 

q = b[$y = @a\//c/b[$y / @a]/b[$y / @a]/b[$y / @a\ 

By a similar reasoning as in (a) it follows that ip is satisfiable if and only if p ^ g. 

(c) We use basically the same construction as in Theorem 13. 31 Let D = {ui, . . . , Cfc} be the 
alphabet used in that construction and assume without loss of generality that k = 2\ 
for some I. We use attributes ai, . . . , a; and one variable x to encode the symbols of D. 
More precisely, ai, . . . , and x represent a^,, if, the z-th digit of 6 is 1 exactly if x = Oj. 
The remaining symbols $ and if^ are still represented by labels. In the expressions p and 
q the element tests are replaced by the wild-card symbol together with the respective 
attribute comparisons. □ 



5.2. Existential semantics. Now we turn to XContainment in the context of existential 
semantics for variables. In some cases the complexity does not change as to compared with 
the XPath semantics, but in others it raises considerably, even leading to undecidability for 
XP(/,//, [],*,|,evars,7^). 

Theorem 5.3. (a) XCONTAINMENT for XP(/, [],evars) is CONP-hard. 

(b) XContainment for XP(/, [],evars, is Il^-hard. 

(c) XContainment for XP(/, |,evars) is n^^-hard. 

Proof. The proofs for (a) and (b) are similar. Both are reductions from containment for 
Boolean Conjunctive Queries (BCQs). In (b) inequalities are allowed, in (a) not. As con- 
tainment of BCQs and BCQs with inequality is hard for CONP j6| and 11^ j34j, respectively, 
(a) and (b) then follow. 

We first describe how we represent a relational database D by a tree t^- The root is 
labeled with S and for every relation R m. D and every tuple (di, . . . , d^) in i? it has a child 
labeled R with n attributes @1, . . . , @n, where, for each i, @i has the value dj. 

A BCQ is a conjunction of relational atoms R{xi, . . . ,2;^), and, in case of (b), inequal- 
ities X ^ y. Note that atoms of the form x = y can be eliminated as each variable must 
occur in a relational atom. An atom R{xi, . . . , x^) can be represented by the subexpression 
Pi = R[$xi = @1] • • • = @k]. 
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We can not represent an inequality a; 7^ y by an expression [$x 7^ as variables can 
only be compared with attributes. Nevertheless, as x must occur as the i-th entry in a 
relational atom, we add [y 7^ @i] to the expression for L. 

We illustrate the construction with an example. For instance, if Q is E{x, y), E{y, z),x ^ 
z then pq is 

S[E[$x = @l][$y = @2][$z / = @l][$z = @2]]. 

Clearly, for a database D, D \= Q if and only if t^) \=3 pq. On the other hand, from each 
t (even if it is not of the intended form) we can define, in a straightforward manner, using 
all correctly encoded tuples, a database D{t) such that D(t) \= Q if and only if t \=3 pq. 

Thus, for BCQs Qi and Q2 it holds Qi C Q2 if and only if pg^ C pg^, as required. 

The proof of (c) is by a reduction from V3-3SAT |31j . Note that the fragment XP(/, | , evars) 
allows the use of disjunction of variable-attribute comparisons. 

Let Lp = Vxi • • • Xm^yi ■ ■ ■ ym^, where ^ = ^1 A ■ ■ ■ A is in 3-CNF with variables from 
{xi, . . . , Xm, yi-, ■ ■ ■ iVm}- Intuitively, the V3-structure of a formula will be mimicked by a 
for all trees there is a variable assignment statement. Hence, the values for xi, . . . ,Xm will 
be encoded in the trees, the values for yi, . . . , y^a will be given by the variable assignment. 

Each tree satisfying p will have two root attributes @T and @F which represent the 
truth values true and false, respectively. Note that, as there are no inequalities available, 
there will be no guarantee that the values of T and F are different. We only make use of 
the node label h. With every path of length m which starts immediately below the root we 
associate a (partial) truth assignment as follows: if the c-attribute of the i-th node equals 
the value of the attribute T, we set p{xi) = T, if it is F then p{xi) = F. 

The expression p is designed to guarantee the existence of such a path. To this end, 
p is b[$zi = @T][$zo = @F]/pi/ ■ ■ ■ /pm, where, for each i, pi = b[%zi = @c or %zq = @c]. 
Intuitively, the variables zq and zi are used to transport the values true and false in the 
tree. 

So far, each tree t with t p induces at least one truth assignment for xi, . . . , Xm- 
Expression q checks that 

• zi and zq take the values @T and @F at the root, respectively, 

• each (XPath) variable Xj has a truth value corresponding to the value of the i-th 
node of some path of the tree, 

• each (XPath) variable yi is either @T or @F, and 

• the induced truth assignment makes 9 true. 

To this end, q = b[ei] ■ ■ ■ [e„][$zi = @T][%zo = @F][$yi = @T or %yi = @F] • • • [%ym = 
@T or $ym = @-F]/6[$xi = @c]/ ■ ■ ■ /b[$Xm = @c], where each is a variable expression 
corresponding to 9i. As an example, if 9i is X2 V ys V -1X1 then ei is [$X2 = @T or $7/3 = 
@r or $xi = @F]. 

Now it is easy to see that if holds iff p C g with respect to the existential semantics. 
Note in particular that trees for which @T = @F at the root fulfill q whenever they fulfill 
p. □ 
Theorem 5.4. XContainment for XP(/,//, [], |, evars, 7^) under existential semantics is 

in nf. 

Proof. We show that for XP(/, //, [], |, evars, 7^)-expressions the following holds. 

(a) p ^ q if and only if there is a tree t of size polynomial in |p| + \q\ such that t p but 
t ^3 q, and 
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(b) t\=BP can be tested in NP. 

Hence, the algorithm Guess a tree t of polynomial size and check that t \= p but t Y= q is a 
S2-algorithm for the complement of XContainment. 

To prove (a), let p and q be expressions and let pi| • • • \pm and qi\ - • - {qn be the DNF 
of p and q, respectively. As the disjuncts do not contains disjunction themselves they can 
again be represented as tree patterns with additional constraints reflecting the equalities 
and inequalities between variables and attributes. 

Clearly, p <^ q if and only if for some i, pi 2 <?• Hence, as i can be guessed, in proving 
(a) we can restrict to the case where p does not contain | . 

We call a tree t {p, g)-canonical if the following conditions hold. 

• The tree structure of t is obtained from the tree pattern t{p) by replacing each //- 
edge by a path of length two with two child edges and a new intermediate #-labeled 
node where # is a label neither occurring in p nor q. Note that the number of 
vertices of t is at most twice the number of vertices of t(p). 

• The attribute values in t are from the set {0, . . . ,mk}, where m is the number of 
vertices in t and k is the number of attributes occurring in p or q. 

Let S{p, q) denote the set of all (p, g)-canonical trees. Note that, as the data values are 
bounded by + \q\)ra each of these trees can be encoded by a string of polynomial size. 

We show next that, whenever p % q for an expression p from XP(/, //, [],evars, 7^) and 
an expression q from XP(/,//, [], |,evars,/), there is a tree t G S{p,q) that matches p but 
not q. 

Therefore let p ^ q be witnessed by a tree t' not necessarily from S{p, q). Hence, t' p 
but t' ^3 q. Let e be a homomorphism from t{p) to t'. Let ai, . . . , a;, I < k, be the pairwise 
different attribute values of the vertices in e(r(p)). 

We construct t as follows. Its structure is obtained from t{p) as above by replacing 
//-edges with new nodes labeled We call a vertex v oit that is already in r(p) an original 
vertex and write p{v) for its corresponding vertex in r(p). An original vertex v oi t inherits 
its attribute values from e{p{v)) as follows. If attribute b of e{p{v)) has value Oj then v gets 
the attribute value i. The attributes of the other nodes get the value 0. 

Let u and u' be (not necessarily distinct) original vertices in t and let b, b' be two 
attributes. Then the 6-attribute of u is different from the ^'-attribute of u' if and only if 
the 6-attribTitc of e{p{v)) is different from the ^'-attribute of e{p{u')). 

Clearly, t € S{p, q) and t |= p via the obvious homomorphism. It remains to show that 
i ^ g. Assume otherwise. Hence, for some j, t \= qj. Let e' be a homomorphism from T{qj) 
to t. As q does not contain the symbol # and there are no wild-cards, the image of T{qj) 
under e' only contains original vertices of t. As these vertices have the same relationships 
within each other as their corresponding vertices in t' we can conclude that t' also matches 
qj. This concludes the proof of (a). 

To show (b), we remark that whether t p for an expression p in XP(/, //, [], |, evars, 7^) 
can be tested as follows. First, a disjunct pi of the disjunctive normal form of p is guessed. 
Next, a homomorphism from T{pi) to t and a value assignment for the variables of Pi are 
guessed (with values < \pi\) and it is checked whether all conditions hold. □ 

We note in passing that for variables with existential semantics even query evaluation 
is hard. 

Proposition 5.5. (a) Evaluation of Boolean XP(|, evars)-expressions is NP-hard. 
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(b) Evaluation of Boolean XP(/, [], evars)-expressions is NP-hard. 

Proof, (a) The proof is by reduction from 3SAT. Let (p = AILi fi be a 3CNF formula with 
variables from {xi, . . . , Xm}- Let t be the tree consisting of a single vertex with attributes 
@T = 1 and @F = 0. Let p be the expression [$xi = @T or $xi = @F] ■ ■ ■ [$Xm = 
@T or $Xm = @F]py,, where = p^p^ ■ ■ -pip^ and each p^. represents clause ipi. E.g., if 
Vi = (^3 V -ixs V xi) then p^. is [8x3 = @T or 8x5 = @F or $xi = @T]. Clearly, ip is 
satisfiable iff t p. 

(b) This follows easily from the correspondence between XP and BCQ as explained in the 
proof of Theorem 15.31 and the fact that BCQ-evaluation is hard for NP. □ 

The following theorem shows that in the setting of variables with existential semantics 
the wild-card has a strong impact. 

Theorem 5.6. XContainment for XP(/,//,*,|,evars,/) is undecidable. 

Proof. We use a reduction from Post's Correspondence Problem (PCP) which is well-known 
to be undecidable JH]- An instance of PCP is a sequence of pairs (xi, yi ),..., (x„, 
where Xj, yi € {a, b}* for i = 1, . . . ,n. This instance has a solution if there exist m G N and 
ai, . . . , Om G {1, • • • ) '^l such that x^i • • • x^^ = Uai ■ ■ ■ Vam- We construct a DTD d, and 
two XPath expressions pi and p2 such that pi p2 iff the PCP instance has a solution. 
At the end of the proof, we will explain how we can get rid of the DTD. 

We consider unary XML-trees, that is, strings. They are roughly of the form u%v, where 
$ is a delimiter and n, v are strings representing a candidate solution (x^^ , • • • , Xq,^ ; y^j^ , • • • , y/3„ 
for the PCP instance in a suitable way. To check whether such a candidate is indeed a so- 
lution, we roughly have to check whether 

(1) for each z, Oj = /Jj, that is, corresponding pairs are taken; and 

(2) both strings are the same, that is, corresponding positions in x^j • • • Xq,^ and ■ ■ ■ ya„ 
carry the same symbol. 

To check (1) and (2), we make use of a double indexing system based on the values of the 

attributes block and position of the nodes in u and v. We explain the intuition behind our 

reduction by means of a small concrete example. 

The DTD will define strings of the form S &. For instance, the candidate solution 

X1X2; 2/12/2 where xi = ab, X2 = 6, 2/1 = a, and 2/2 = bb will be represented as a concatenation 

of X- and y-blocks as follows: 

S X l(x) a(x,l,l) fe(x, 1,2) X 2(x) b{x,2,l) # 
block 1 2 

position 12 3 

Y 1(2/) a(2/,l,l) Y 2{y) %, 2, 1) 6(2/, 2, 2) & 
block 1 2 

position 1 2 3 

Here, the block and position rows indicate the values of the respective attributes. The 
symbols X and Y indicate the beginning of an x- and 2/-block, respectively. The symbol 
l(x) means that xi is picked; and, a(x, 1, 1) 6(x, 1, 2) encode that xi is the string ab. More 
precisely, ct(x, i, j) encodes that the j-th position in the string Xj is a. We need this involved 
encoding as we will define a DTD that can only produce valid sequences of blocks. The 
attributes "block" and "position" make up the double index system as will become clear 
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further on. We refer to blocks corresponding to encodings of an Xi and yi as X-blocks and 
y-blocks, respectively. If a block corresponds to Xj or t/i we say that its number is i. 

Let, for each i < n, Xi he a\ ■ ■ ■ aj,. and yihe S{ - ■ ■ S^.. Then the DTD d consists of the 
productions 

5 ^ X 

X l{x)\...\n{x) 
# ^ Y 

Y ^ l{y)\...\n{y) 

6 ^ £ 

and further sets P{xi) and P{yi) of productions, for each i < n. Here, P{xi) consists of the 
productions i{x) cr\{x, i, 1), and for j := 1, . . . , fcj — 1, (T* (x, — > a'j^i{x, i,j + 1), and 
a'i^.{x,i, ki) — > X I Analogously, P{yi) is the set of productions i{y) — >■ 6\{y,i, 1), and for 
j :=l,...,£i-l, 6]{y,i,j) 5]j^^{y,i,j + 1), and 5\,{y,i,li) ->Y \k. The start symbol 
is S. Every X and Y has an attribute block; every a and h has an attribute position. 

Formally, a tree Su^v&i is syntactically correct if u and v contain the same number 
of blocks and it fulfills the following condition. For z G {u, v}, let block(z) be the list 
consisting of the block attribute-values of the nodes in z and let position(2;) be the list 
consisting of the position attribute- values of the nodes in z. Then it should be the case that 
block('u) = block(i;) and position('u) = position(i;). A syntactically correct string Su$v&l 
represents a solution of the PCP instance, iff the block numbers of corresponding blocks 
are the same and the values (a or b) of corresponding positions are the same. 

Let p be the XP-expression S and let d be as above. We next construct q' in such a way 
that it selects the root of an XML-tree if and only if it is not syntactically correct or does 
not represent a solution. As p defines all inputs, p q' if and only if the PCP instance 
has no solution. 

In the following, if z is the string abab then we use z as a shorthand for a/b/a/b. Further, 
we denote the string generated by the grammar from Xi and yi by and y^, respectively. 
That is, for xi = ab, xi = a(x, 1, l)6(x, 1, 2). 

(1) The block index is wrong. 

(a) the block value of the first X in u differs from the block value of the first Y in 
v: 

S/X[$d = @block]//#/Y[$d ^ ©block]. 

(b) the block value of the last X in u differs from the block value of the last Y in 

v: for each i,j € {1, . . . , n} we have 

S//X[$d = miock]/i{x)/xi/#//Y[$d ^ ©block] /j{y)/yik 

(c) two X-block values arc the same: 

S//X[$d = ©block]// X[$d = ©block]/ /# 

(d) two y-block values are the same; 

S//#//Y[$d = ©block] //Y[$d = ©block] 

(e) two successive X-block values are not successive in v: for all i,j G {1, . . . ,n} 
we have 

S//X[$d = ©block]/i{x)/xi/X[$e = ©block]// 

i^//Y[%d = ©block]/ j{y)/yj/Y[%e ^ ©block]. 

(2) The position index is wrong. This is done in an analogous fashion. 
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(a) the first position in u differs from tfie first position in v: 

S/X/ * / * [$d = ©position]/ /Y/ * / * [$d 7^ ©position]. 

(b) the last position in u differs from the last in v. 

S/l * [%d = ©position]/^// * [$d 7^ ©position]/ k 

(c) two X-position vahies are the same: 

S// * [$d = ©position]// * [$d = ©position]/ /# 

(d) two l"-position values are the same; 

S//#// * [$d = ©position]// * [$d = ©position] 

(e) two successive X-position values are not successive in v: we have to deal with 
several cases as the successive positions can occur in the same block or in 
successive blocks. 

(i) the X-positions occur in the same block, the K-positions occur in the 
same block: for all i,j G {1, . . . ,n}, k & {1, . . . , \xi \ — 1} , £ & {I, . . . , \yj\ — 

1}: 

S//X/i{x)/ ^^-'^ / * [$d = ©position]/ * [$e = ©position] 

//Y/3{y)// / *[%d = ©position]/ * [$e ^ ©position] 

(ii) the X-positions occur in successive blocks, the ^-positions occur in the 
same block: for all i,j G {!,..., n}, £ G {1, . . . , — 1}: 

S//X/i{x)/ 'kI'^*!-^ / *[%d = ©position]/ X/ * / * [$e = ©position] 

//Y/3{y)// / *[%d = ©position]/ * [$e ^ ©position] 

(iii) the X-positions occur in the same block, the ^-positions occur in succes- 
sive blocks: for all i,j G {1, . . . ,n}, k ^ {1, . . . ,\xi\ — 1}: 

S//X/i{x)/ / *[%d = ©position]/ * [%e = ©position] 

l/y/jiy)// / *[%d = ©position]/Y/ * / * [$e 7^ ©position] 

(iv) the X-positions occur in successive blocks, the y-positions occur in suc- 
cessive blocks: for all i, j G {1, . . . , n}: 

S//X/i{x)/ * [$d = ©position]/ X/ * / * [$e = ©position] 

//y/j{y)// / *[%d = ©position]/Y/ * / * [$e 7^ ©position] 

(3) w does not represent a solution: 

(a) The block number for some block in u is different from the corresponding block 
in v: for all j G {1, . . . , n} with i ^ j: 

S//X[$d = ©block]/i{x)//Y[$d = ©block]/ j{y) 
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(b) The symbol (a or b) in u is different from the corresponding symbol in v. 
Thereto, we have the following expressions: for all i,j € {l,...,n}, k € 
{l,...,\x,\},iG{l,...,\yj\} 

S//a{x,i, k)[$d = ©position]// #//b{y,j,£)[$d = ©position] 

and 

S/ /b{x,i,k)[$d = ©position] //i^//a{y,j,£)[$d = ©position] 

Clearly, w is not syntactically correct or does not represent a solution if and only if one of 
the above conditions holds. 

To get rid of the DTD, we add disjuncts to q' that capture violations of the DTD. 
However, to this end we need to express that children of a node cannot have a certain label. 
As we can not express this kind of negation directly, we encode labels of nodes by equality 
types of attribute values. So, let L := {£i, . . . , im} be the set of all the labels we need. 
Every node now has m attributes ai, . . . ,am- If for a node, j > 1 is the largest number 
such that the value of oi equals the value of aj then the node is considered as labeled with 
ij. One can match a node labeled with ij by checking the corresponding equality type: for 
instance, by an expression of the form 

= ©ai][$X2 / ©ai] ■ ■ ■ / @ai][$Xj = @ai]. 

Clearly, when using this approach we can express that a certain node is not labeled by a 
certain label. For every rule a — > 61 | • • • | 6^ in the DTD we add the disjunct //a/c to 
q' where c £ L \ {bi, . . . ,bk}- When we write / /a/c, we of course mean XPath expressions 
taking labels into account as specified in the manner above. Let q be obtained from q' by 
adding all the disjuncts from the DTD and replacing all references to labeling by references 
to encoding with attributes. Note that now we allow arbitrary trees as well. 

It remains to argue that p ^ q iS the PCP instance has a solution. When there is a 
solution to the PCP then clearly the encoding of this string will match p but not q. Suppose 
that there is a tree that matches p but not q. This means that no error occurs on any path 
in the tree. Therefore, every path is an encoding of a solution to the PCP instance. □ 

6. Discussion 

The article studied the complexity of the containment problem for a large class of XPath 
expressions. In particular, we considered disjunction, DTDs and variables. Unfortunately, 
the complexity of almost all decidable fragments lies between CONP and exptime. On the 
other hand, the size of XPath expressions is rather small. As pointed out, Deutsch and 
Tannen, and Moerkotte already obtained undecidability results for XPath containment. 
We added modest negation (7^) and variables with the existential semantics. In j3Uj a 
corresponding result is shown in the presence of node-set equality. However, the reduction 
employs XPath expressions with absolute filter expressions which fall outside the scope of the 
present paper. It would be interesting to have a precise classification of which combination 
of features makes the problem undecidable. 

Although the complexity has been settled for a lot of fragments of XPath with child 
and descendant axes, the picture is by no means complete. A particular case that remains 
open is the case of XP(DTD, /, //, *). 
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Corresponding results in the presence of other axes remain to be investigated. Besides 
the general upper bound of Marx j24j for navigational XPath with all axes, few precise 
results are known. Work on satisfiability of XPath with the sibling axis has been done by 
Fan and Geerts [T^ . 
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